Bayesian Conditional Density Filtering for Big Data

نویسندگان

  • Rajarshi Guhaniyogi
  • Shaan Qamar
  • David B. Dunson
چکیده

We propose a Conditional Density Filtering (C-DF) algorithm for efficient online Bayesian inference. C-DF adapts Gibbs sampling to the online setting, sampling from approximations to conditional posterior distributions obtained by tracking of surrogate conditional sufficient statistics as new data arrive. This tracking eliminates the need to store or process the entire data set simultaneously. We show that C-DF samples converge to the exact posterior distribution asymptotically, as sampling proceeds and more data arrive over time. We provide several motivating examples, and consider an application to compressed factor regression for streaming data, illustrating competitive performance with batch algorithms that use all of the data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Conditional Density Filtering

We propose a Conditional Density Filtering (C-DF) algorithm for efficient online Bayesian inference. C-DF adapts MCMC sampling to the online setting, sampling from approximations to conditional posterior distributions obtained by propagating surrogate conditional sufficient statistics (a function of data and parameter estimates) as new data arrive. These quantities eliminate the need to store o...

متن کامل

United Statistical Algorithms, Small and Big Data, Future of Statistician

Role of big idea statisticians in future of Big Data Science. United Statistical Algorithms framework for comprehensive unification of traditional and novel statistical methods for modeling Small Data and Big Data, especially mixed data (discrete, continuous). Goal: Model (X, Y ) by nonparametrically estimating conditional mean E[Y |X = x] and conditional quantile Q(u;Y |X = x). Modeling exampl...

متن کامل

Bayesian Prediction Intervals under Bivariate Truncated Generalized Cauchy Distribution

Ateya and Madhagi (2011) introduced a multivariate form of truncated generalized Cauchy distribution (TGCD), which introduced by Ateya and Al-Hussaini (2007). The multivariate version of (TGCD) is denoted by (MVTGCD). Among the features of this form are that subvectors and conditional subvectors of random vectors, distributed according to this distribution, have the same form of distribution ...

متن کامل

Generalised Filtering

We describe a Bayesian filtering scheme for nonlinear state-space models in continuous time. This scheme is called Generalised Filtering and furnishes posterior conditional densities on hidden states and unknown parameters generating observed data. Crucially, the scheme operates online, assimilating data to optimize the conditional density on time-varying states and time-invariant parameters. I...

متن کامل

Variational filtering

This note presents a simple Bayesian filtering scheme, using variational calculus, for inference on the hidden states of dynamic systems. Variational filtering is a stochastic scheme that propagates particles over a changing variational energy landscape, such that their sample density approximates the conditional density of hidden and states and inputs. The key innovation, on which variational ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1401.3632  شماره 

صفحات  -

تاریخ انتشار 2014